Comparison of 2D and 3D Analysis For Automated Cued Speech Gesture Recognition
نویسندگان
چکیده
This paper deals with the problem of the automated classification of cued speech gestures. Cued speech is a specific gesture language (different from the sign language) used for communication between deaf people and other people. It uses only 8 different hand configurations. The aim of this work is to apply a simple classifier on 3 images data sets, in order to answer two main questions: is 3D data needed, and how important is the hand segmentation quality ? The first data set consists of images acquired with a single camera in a controlled light environment and a segmentation (called “2D segmentation”) based on luminance information. The second data set is acquired with a 3D camera which can produce a depth map; a segmentation (called “3D segmentation”) of the hand configurations based on the video and the depth map is performed. The third data set consists in 3D-segmented masks where the resulting hand mask is warped to compensate for hand pose variations. For the classification purposes, hand configurations are characterized by the computation of the seven Hu moment invariants. Then a supervised classification using a multi-layer perceptron is done. The performance of classification based on 2D and 3D information are compared.
منابع مشابه
Hand Gesture Recognition from RGB-D Data using 2D and 3D Convolutional Neural Networks: a comparative study
Despite considerable enhances in recognizing hand gestures from still images, there are still many challenges in the classification of hand gestures in videos. The latter comes with more challenges, including higher computational complexity and arduous task of representing temporal features. Hand movement dynamics, represented by temporal features, have to be extracted by analyzing the total fr...
متن کاملPerception and Synthesis of Biologically Plausible Motion: From Human Physiology to Virtual Reality
Temporal measures of hand and speech coordination during French cued speech production p. 13 Using signing space as a representation for sign language processing p. 25 Spatialised semantic relations in French sign language : toward a computational modelling p. 37 Automatic generation of German sign language glosses from German words p. 49 French sign language processing : verb agreement p. 53 R...
متن کاملFusion of children's speech and 2D gestures when conversing with 3D characters
Most existing multi-modal prototypes enabling users to combine 2D gestures and speech input are task-oriented. They help adult users solve particular information tasks often in 2D standard Graphical User Interfaces. This paper describes the NICE Andersen system, which aims at demonstrating multi-modal conversation between humans and embodied historical and literary characters. The target users ...
متن کاملCued Speech Gesture Recognition: A First Prototype Based on Early Reduction
Cued Speech is a specific linguistic code for hearing-impaired people. It is based on both lip-reading and manual gestures. In the context of THIMP (Telephony for the Hearing-IMpaired Project), we work on automatic Cued Speech translation. In this paper, we only address the problem of automatic Cued Speech manual gesture recognition. Such a gesture recognition issue is really common from a theo...
متن کامل3D Hand Motion Evaluation Using HMM
Gesture and motion recognition are needed for a variety of applications. The use of human hand motions as a natural interface tool has motivated researchers to conduct research in the modeling, analysis and recognition of various hand movements. In particular, human-computer intelligent interaction has been a focus of research in vision-based gesture recognition. In this work, we introduce a 3-...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004